Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 2 February 2008 (MN M?eX style file vl.4) 



o 
o 

(N 



(N 
> 

(N 

o 
o 

Oh; 

6 



The 2dF Galaxy Redshift Survey: The clustering of galaxy 
groups 

Nelson D. Padilla\ Carlton M. Baugh^, Vincent R. Eke^, Peder Norberg^, 
Shaun Cole^, Carlos S. Frenk^ Darren J. Croton^, Ivan K. Baldry^, Joss Bland- 
Hawthorn^, Terry Bridges^'^, Russell Cannon^, Matthew CoUess^'^, Chris Collins^, 
. Warrick Couch^, Gavin Dalton^^'^\ Roberto De Propris^, Simon P. Driver^, George 
Efstathiou^^, Richard S. EUis^^, Karl Glazebrook^, Carole Jackson^, Ofer Lahav^*^, 
Ian Lewis^^, Stuart Lumsden^^, Steve Maddox^^, Darren Madgwick^^, John A. 
Peacock^^, Bruce A. Peterson'', Will Sutherland^^'^^, Keith Taylor^^ 

^Department of Physics, University of Durham, South Road, Durham DHl 3LE, UK. 
"^ETHZ Institiit fiir Astronomie, HPF G3.1, ETH Honggerberg, CH-8093 Ziirich, Switzerland. 
^ Max-Planck-Institat fiir Astrophysik, D-85740 Garching, Germany. 

^Department of Physics & Astronomy, Johns Hopkins University, Baltimore, MD 21118-2686, USA. 
^Anglo-Australian Observatory, P.O. Box 296, Epping, NSW 2111, Australia. 
^Department of Physics, Queen's University, Kingston, Ontario, K7L 3N6, Canada. 

^Research School of Astronomy & Astrophysics, The Australian National University, Weston Creek, ACT 2611, Australia. 
^Astrophysics Research Institute, Liverpool John Moores University, Twelve Quays House, Birkenhead, L14 ILD, UK. 
^Department of Astrophysics, University of New South Wales, Sydney, NSW 2052, Australia. 
Department of Physics, University of Oxford, Keble Road, Oxford 0X1 3RH, UK. 

Space Science and Technology Department, CCLRC Rutherford Appleton Laboratory, Chilton, Didcot, Oxfordshire, OXll OQX, UK. 
^■^ Institute of Astronomy, University of Cambridge, Madingley Road, Cambridge CBS OHA, UK. 
Department of Astronomy, California Institute of Technology, Pasadena, OA 91025, USA. 
Department of Physics and Astronomy, University College London, Gower Street, London, WCIE 6BT, UK. 
Department of Physics, University of Leeds, Woodhouse Lane, Leeds, LS2 9JT, UK. 
School of Physics & Astronomy, University of Nottingham, Nottingham NG7 2RD, UK. 
^"^ Department of Astronomy, University of California, Berkeley, CA 92720, USA. 

Institute for Astronomy, University of Edinburgh, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK. 



2 February 2008 



ABSTRACT 

We measure the clustering of galaxy groups in the 2dFGRS Percolation-Inferred 
Galaxy Group (2PIGG) catalogue. The 2PIGG sample has 28 877 groups with at 
least two members. The clustering amplitude of the full 2PIGG catalogue is weaker 
than that of 2dFGRS galaxies, in agreement with theoretical predictions. We have sub- 
divided the 2PIGG catalogue into samples that span a factor of w 25 in median total 
luminosity. Our correlation function measurements span an unprecedented range of 
clustering strengths, connecting the regimes probed by groups fainter than galaxies 
and rich clusters. There is a steady increase in clustering strength with group lumi- 
nosity; the most luminous groups are ten times more strongly clustered than the full 
2PIGG catalogue. We demonstrate that the 2PIGG results are in very good agreement 
with the clustering of groups expected in the ACDM model. 

Key words: galaxies: haloes, galaxies: clusters: general, large-scale structure of Uni- 
verse 



1 INTRODUCTION 

Galaxy groups are important tracers of the matter distri- 
bution in the Universe, offering a powerful alternative to 
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individual galaxies. Part of the appeal of galaxy groups lies 
in the simplicity of their relation to the dark matter. The- 
oretically, each dark matter halo yields a single group of 
galaxies. This is a clear advantage over the use of galax;ies 
to trace mass, as the occupation of haloes by galaxies de- 
pends upon halo mass (e.g. Benson et al. 2000; Berlind et al. 
2003). Typical groups are less elitist than the richest clus- 
ters which only account for a few percent of the mass of the 
universe (e.g. Jenkins et al. 2001). The measurement of the 
clustering amplitude of groups as a function of their mass is 
an important probe of hierarchical clustering (Governato et 
al. 1999; Colberg et al. 2000; Padilla & Baugh 2002). 

Previous attempts to measure the clustering of groups 
have been hampered by the small size of the samples. For ex- 
ample, Jing & Zang (1988) measured the redshift space cor- 
relation function of a sample of 135 groups extracted from 
the CfA survey by Geller & Huchra (1983). By the end of 
the 1990s, this situation had only improved slightly with 
an analysis of ~ 500 groups in the Updated Zwicky Cata- 
logue by Merchan, Maia & Lambas (2000). Perhaps unsur- 
prisingly, these early studies were inconclusive. One dispute 
concerned the relative strength of the auto-correlation func- 
tion of groups and of galaxies. Kashlinsky (1987) produced 
an analytic argument suggesting that a sample of groups cor- 
responding to haloes less massive than a chaxacteristic mass 
M* (see Section 2.3 for a definition of M«) should display a 
weaker clustering signal than that of their constituent galax;- 
ios. Similar conclusions can be reached from the calculation 
of the bias of dark matter haloes first carried out by Cole & 
Kaiser (1989), and subsequently by Mo & White (1996) and 
Sheth, Mo & Tormen (2001). However, the first results com- 
paring the clustering of groups and galaxies both confirmed 
(Jing & Zang 1988; Maia & da Costa 1990) and contra- 
dicted (Ramella, Geller & Huchra 1990; Girardi, Boschin & 
da Costa 2000) this theoretical prediction. 

An accurate measurement of the clustering of groups is 
now particularly timely as theoretical models have advanced 
to the point where detailed predictions can be made for the 
clustering of galactic systems. Techniques have been devel- 
oped recently that allow high resolution N-body simulations 
of hierarchical clustering to bo populated with galaxies using 
semi-analytic models (Kauffmann et al. 1999; Benson et al. 
2000; Helly et al. 2003). The models make firm predictions 
for how the luminous baryonic component of the universe is 
partitioned between dark matter haloes. 

A significant advance in statistical studios of group 
properties was made possible by the two-degree Field 
Galaxy Redshift Survey (2dFGRS; CoUess et al. 2001; 2003). 
Merchan & Zandivarcz (2002) used the 2dFGRS 100k data 
release to construct a group catalogue, the 2dFGGC, con- 
sisting of 2198 groups with at least four members. The cor- 
relation function of the 2dFGGC in redshift space can be 
described by a power law with correlation length so = 8.9 ± 
0.3h~^Mpc and slope 7 = —1.6 ± 0.1 (Zandivarcz, Merchan 
& Padilla 2003). The clustering amplitude of 2dFGGC 
groups is significantly higher than that measured for 2dF- 
GRS galaxies, for which so = 6.82 ± 0.28/i"^Mpc (Hawkins 
et al. 2003). 

The completion of the 2dFGRS has made it possible 
to construct a much larger group catalogue, the 2dFGRS 
Percolation Inferred Galaxy Group catalogue (hereafter the 
2PIGG catalogue; Eke et al. 2004a). The gain is bigger than 



the simple factor of just over 2 increase in the number of 
galaxy redshifts; the improved homogeneity and higher spec- 
troscopic completeness means that group finding is far more 
efficient in the final 2dFGRS catalogue. To quantify this im- 
provement, the number of groups with a minimum of four 
members in the 2PIGG catalogue is 7200, over three times 
the number of comparable groups detected by Merchan & 
Zandivarez (2002). Moreover, Eke et al. demonstrate that 
the group finding algorithm can be pushed further still to ex- 
tract groups containing two or more members, which greatly 
increases the number of groups. 

In this paper, we study the clustering of groups in the 
2PIGG catalogue. The properties of the catalogue are sum- 
marized in Section 2, along with an explanation of how we 
construct subsamples of groups defined by mass. The key 
role played by mock catalogues of groups is also outlined 
in Section 2. The estimation of the correlation function of 
groups is set out in Section 3. A number of assumptions need 
to be made in order to measure the clustering of groups and 
these issues arc dealt with in Section 4, in which we use 
mock catalogues extensively to quantify the likely random 
and systematic errors on our meeisurements. The clustering 
measurements for 2dFGRS groups are presented in Section 
5. A summary of our conclusions is given in Section 6. 



2 THE DATA AND SAMPLE CONSTRUCTION 

In this Section we describe the catalogue from which mea- 
surements of the clustering of groups are made, the role 
played by mock datasets in our analysis and the definition 
of subsamples of groups. 

We use the 2PIGG catalogue constructed by Eke et al. 
(2004a). The 2PIGG sample is the largest and most ho- 
mogenous group catalogue currently available and therefore 
provides a unique tool with which to study the clustering 
of galactic systems. The properties of the 2PIGG are out- 
lined in Section 2.1. An important and novel feature of the 
Eke et al. analysis is the extensive use of realistic, physically 
motivated mock catalogues to calibrate the performance of 
the group finding algorithm and to understand how the re- 
covered groups relate to the underlying distribution of dark 
matter. We utilize these catalogues to assess possible sys- 
tematic effects in the measurement of clustering from the 
2PIGG sample and to estimate the errors on our results; a 
brief outline of the mocks is given in Section 2.2. We present 
the rationale behind our definition of subsamples extracted 
from the 2PIGG in Section 2.3. Finally, the choice of rela- 
tion between an observed group property and the underlying 
group mass is justified in Section 2.4. 

2.1 The 2PIGG Catalogue 

The 2PIGG catalogue was constructed from the completed 
2dFGRS as described by Eke et al. (2004a). The group find- 
ing algorithm requires a small number of parameters to be 
set. The motivation for the adopted parameter values and 
the consequences of these choices for the accuracy and com- 
pleteness of the catalogue are discussed at length by Eke et 
al. (2004a). Eke et al. find that after applying the identifi- 
cation algorithm to the 2dFGRS (which contains ~ 190000 
galaxies in the two large contiguous regions), 56 percent of 
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Table 1. The properties of the 2PIGG samples used. A maximum redshift of z = 0.12 is adopted and there are a minimum of two 
galaxies per group. The first column gives the sample label (0 denotes the full group sample), the second gives the lower luminosity 
limit that defines each sample, the third column gives the median luminosity of groups in the sample, the fourth column gives the mean 
number of galaxies per group (the value in brackets gives the median number of group members), the fifth column gives the median 
velocity dispersion along with half the interval containing 68% of the groups, the sixth column gives the median mass of the groups 
(group mass is obtained using eq. 4.7 from Eke et al. 2004a), the seventh column gives the combined number of groups in the NGP and 
SOP regions, the eighth column gives the mean separation of groups and column nine gives the redshift space correlation length. 
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the galaxies are grouped into 28 877 groups containing at 
least two members. These groups have a median velocity dis- 
persion of 190kms"^ if groups of four or more members are 
considered, this value changes to 260kms~^. In both cases, 
the median redshift is 2: = 0.11. The 2PIGG catalogue is 
sufficiently large that it may be divided into subsamplos in 
order to measure trends in clustering strength with group 
properties; the construction of subsamples from the 2PIGG 
catalogue is set out in Section 2.3. 

2.2 Mock 2PIGG catalogues 

Two types of mock catalogue are used in this paper: semi- 
analytic mocks and Hubble Volume mocks. These mocks 
have different underlying physics and play different roles in 
our analysis. 

(i) Semi-analytic mocks. We use the mock 2PIGG cata- 
logues constructed by Eke et al. (2004a). These mocks are 
produced from a high resolution N-body simulation which 
is populated with galaxies using the semi-analytic model, 
GALFORM (Cole et al. 2000; Benson ct al. 2002, 2003). The 
simulation uses standard ACDM parameters, with the nor- 
malization of the density fluctuations given by as = 0.80 
and a primordial spectral index of n = 0.97, in line with the 
constraints from the first year of data from WMAP (Spergel 
et al. 2003). The N-body simulation box is 250/i"^Mpc on 
a side and contains 1.25 x 10* dark matter particles each 
of mass 1.04 x lO^°ft"^M0. We use the z = 0.117 simu- 
lation output, which is close to the median redshift of the 
2dFGRS. Dark matter haloes are identified using a friends- 
of- friends algorithm with a linking length of 0.2 times the 
mean interparticle separation (Davis et al. 1985). 

The halo resolution limit is taken to be 1.04x lO^^/i~^M0. 
The GALFORM code is run for each halo and galaxies arc as- 
signed to a subset of dark matter particles in the halo follow- 
ing the technique described by Benson et al. (2000) . Around 
90% of central galaxies brighter than M^j — 5 logjQ h = 
—17.5 are expected to be in haloes resolved by the simu- 
lation. The effective limit of the catalogue is extended to 
— 5 logjQ ft = — 16 using a separate GALFORM calculation 
for a grid of halo masses below the resolution limit of the 
N-body simulation. These galaxies are assigned to particles 



that are not identified as part of a resolved dark matter halo; 
such particles account for approximately 50% of the dark 
matter in the z = 0.117 simulation output. The luminos- 
ity function predicted by the semi-analytic model is close to 
that estimated for the 2dFGRS (Norbcrg ct al. 2002b): how- 
ever, we force the luminosity function of the model galaxies 
to agree with the 2dFGRS luminosity function by rescal- 
ing the luminosity of the model galaxies. A mock 2dFGRS 
is then extracted by placing an observer at a random loca- 
tion within the simulation cube and applying the radial and 
angular selection function of the 2dFGRS (Norbcrg et al. 
2002b). The simulation box needs to be replicated several 
times to cover the full 2dFGRS volume and geometry. For 
our absolute magnitude limit, the mock 2dFGRS is com- 
plete above z — 0.04. The mock 2dFGRS is then degraded 
by applying the redshift completeness mask, and the mag- 
nitude and velocity errors of the 2dFGRS (Norberg et al. 
2002b). The Eke et al. group finding algorithm is applied to 
the mock 2dFGRS catalogue in exactly the same way and 
with the same parameter values as for the real data. We will 
henceforth refer to the resulting mock 2PIGG catalogue as 
the GALFORM mock. 

The GALFORM mock has three roles to play in our analy- 
sis: (1) To determine which of the observed characteristics 
of 2PIGG systems provides the most reliable estimate of 
the true mass of the underlying dark matter halo in which 
the group resides. (2) To assess the systematic errors in our 
clustering estimates, by comparing measurements from a 
2PIGG mock with the measurement for an equivalent sam- 
ple of galaxy groups extracted from the simulation cube (i.e. 
before applying the 2dFGRS selection criteria and mask). 
This allows us to quantify how well our clustering estima- 
tor compensates for the selection function of the survey and 
its redshift incompleteness. The results of these comparisons 
are used to define sample selection criteria that, when ap- 
plied to the 2PIGG catalogue, will yield the most robust 
and reliable clustering measurements. The construction of 
the equivalent samples is outlined below. (3) To cast the 
predictions of the ACDM model for the clustering of galac- 
tic systems in a form that can be confronted directly with 
the observations. 

Finally, we explain the criteria used to construct sam- 
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pies of groups from the N-body simulation cube that can be 
compared directly with the groups taken from the GALFORM 
mock; these samples will be denoted as equivalent samples. 
We will employ two approaches to construct equivalent sam- 
ples. In the first approach, the spatial abundance of groups 
identified in the GALFORM mock is reproduced in the simula- 
tion cube by selecting an appropriate fraction of the groups 
at each mass. The second method is in two stages. First, 
an effective bias is computed for the groups in the GALFORM 
mock, using Eq. 1 below, and the true mass of each group. 
Second, groups more massive than some lower mass limit 
in the simulation cube are selected, such that the effective 
bias of those groups matches that computed for the mock 
2PIGG sample in the first stage. The correlation functions 
measured from these equivalent samples are compared with 
direct measurements from the GALFORM mock in Section 4.2. 

(ii) Hubble Volume mocks. Eke et al. (2004a) also made 
use of mock 2dFGRS catalogues extracted from the ACDM 
Hubble Volume siirmlation (Jenkins et al. 2001; Evrard et 
al. 2002). In this case, the "galcixies" are dark matter parti- 
cles that are selected in order to have a clustering strength 
similar to that measured for galaxies in the flux limited 2dF- 
GRS (Hawkins et al. 2003). The selection is based upon the 
smoothed density of the dark matter distribution (Cole et 
al. 1998; see also Norberg et al. 2002b). The large volume 
of the Hubble Volume simulation (27Gpc*) allows a high 
rmmbcr of independent mock versions of the 2dFGRS to be 
extracted; the ensemble that we consider contains 22 2dF- 
GRS mocks. The 2PIGG group finding algorithm is run on 
these Hubble Volume mocks to produce a set of groups in 
each mock. The primary aim of these mocks is to provide 
an estimate of the error on measured clustering statistics; 
these errors naturally include the contribution from sample 
variance due to large-scale structure. We compute the rms 
scatter over the ensemble of 22 mocks in the manner de- 
scribed by Norberg et al. (2001). To recap, the correlation 
function measured from one of the mocks is considered as the 
"mean" and the scatter of the remaining mocks around this 
mean is computed. This process is repeated for each mock 
in turn. The rms scatter is the resulting mean scatter. This 
procedure gives an rms scatter that is larger than the formal 
variance over the ensemble of mocks, since, to some extent, 
it takes into account the covariance between the correlation 
function measurements in different bins. The fractional rms 
scatter obtained from the mocks is taken as am estimate of 
the statistical error and is applied to the measured correlar 
tion functions. 

2.3 Subsample definition 

One could consider dividing the 2PIGG catalogue into ei- 
ther cumulative or differential bins in a property related 
to group mass. Naively, one might anticipate that a divi- 
sion into differential mass bins would give a cleaner trend of 
clustering amplitude varying with increasing sample mass, 
since the clustering signal from cumulative samples might be 
dominated by the most massive objects. To investigate this 
prejudice, we use theoretical models of the clustering of dark 
matter haloes and ascertain which of these two alternatives 
is the best way to split up the 2PIGG catalogue. 

The theoretical predictions for halo clustering are based 
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Figure 1. Theoretical predictions for the cfTcctivc bias factor, 
6cff , of a sample of dark matter haloes, as defined by Eq. 1. The 
curves show the expectations for samples of haloes defined using 
either differential (dashed) or cuumlative mass bins (solid), com- 
puted using the formalism developed by Sheth, Mo & Tormen 
(2001; SMT). The horizontal line indicates a bias of unity. The 
thin lines that intersect the curves and run parallel and perpen- 
dicular to the axes indicate the dynamic range of effective biases 
for the samples we use in this paper. The effective bias is com- 
puted from the GALFORM mock and the calculation is described 
fully in Section 2.4. The thin dashed lines show the predictions 
for 2 or more members per group and the thin solid lines show 
the results for a minimum of 4 members per group. 

upon the formalism developed by Cole & Kaiser (1989) and 
Mo & White (1996; hereafter MW). These authors com- 
puted an asymptotic bias for dark matter haloes, using ex- 
tended Press & Schechter (1974) theory and the spherical 
collapse model. Sheth, Mo & Tormen (2001; hereafter SMT) 
improved upon this calculation by incorporating an ellip- 
soidal collapse model. Both approaches have been tested ex- 
tensively against direct predictions from N-body simulations 
(Governato et al. 1999; Colberg et al. 2000; SMT; Padilla & 
Baugh 2002). 

The effective bias for a sample of dark matter haloes is 
computed by weighting the bigis factor associated with each 
individual halo, 6(M), by its abundance in the sample (e.g. 
Baugh et al. 1999; Padilla & Baugh 2002): 

6^^- C^W(dn(M)/dM)dM 
Q{An{M)IAM)AM 

where dn(M)/dM is the space density of haloes in the mass 
interval M to M -I- (5M. In the case of cumulative mass bins, 
the lower mass limit. Mi, defines the sample, as M2 00. 

Fig. 1 shows the theoretical predictions for the effective 
bias as a function of the median mass of the sample. For 
differential mass samples, an interval of one decade in mass 
is assumed. To make these predictions, we have adopted the 
linear theory power spectrum used in the N-body simulation 
described in Section 2.2. The clustering amplitude of the 
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logio(Mt™e/h-' MJ 

Figure 2. The corrected total group luminosity plotted as a func- 
tion of true mass for groups extracted from the GALFORM mock with 
z < 0.12. The grey scale indicates the number of groups, with a 
darker shading indicating a higher number of groups. The solid 
line shows the median, and the dashed lines enclose the interval 
within which 68% of the groups lie. 

overall dark matter distribution corresponds to & = 1. The 
MW theory predicts that haloes with mass below a charac- 
teristic value M* (defined as the mass contained within a 
sphere for which the linear theory variance is equal to the 
threshold for collapse) will display a weaker clustering signal 
than the overall mass distribution, i.e. these haloes will have 
6(M) < 1. 

The dynamic range in mass of the 2PIGG samples that 
we consider is « lO^^h'^M© to lO^h'^M©. Over this in- 
terval, the theoretical predictions for cumulative and differ- 
ential mass bins have similar shapes, so there is no clear 
advantage in using differential mass bins. Moreover, the cu- 
mulative samples benefit from better statistics. For these 
reasons, we will subdivide the 2PIGG sample using cumu- 
lative bins in a group property related to mass (see Section 
2.4). 

2.4 Indicator of group mass 

Our goal in this paper is to estimate the clustering of sub- 
samples of the 2PIGG catalogue, defined by group mass. We 
therefore need to find an observed property of the 2PIGGs 
that displays a tight correlation with the underlying or 
"true" group mass. Following Eke et al. (2004a), we have 
considered two possibilities: (i) A dynamical mass estimate 
that depends upon the rms radius of the group and its esti- 
mated velocity dispersion, (ii) An indirect estimate in which 
the corrected total group luminosity is used to infer a mass 
from the estimated median mass to hght ratio of 2PIGGs. 

From the GALFORM mock catalogue, it is possible to find 
the "true" mass of any group by associating it with a dark 
matter halo in the simulation box. The group's "true" mass 
is simply the mass of the halo, as computed by summing 
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Figure 3. The distribution of true mass for subsamples of mock 
2PIGG groups with at least 2 galaxy members. The upper panel 
shows the mass distributions for subsamples selected by cumu- 
lative total luminosity and the lower panel shows the results for 
samples selected by cumulative bins in dynamical mass; the bins 
are given by the key in each panel. The points with error bars 
show the median and the 10-80 percentile range in true mass for 
each bin. 

the particles identified as members of the halo by a friends- 
of-friends group finding algorithm. Fig. 2 shows the relation 
between corrected total group luminosity and the true group 
mass for the GALFORM mock. The total group luminosity is 
obtained by summing the luminosity of the galaxies identi- 
fied as members of each group, and then applying a correc- 
tion to account for missing galaxies that are fainter than the 
apparent magnitude limit of the 2dFGRS. This correction 
factor is redshift dependent and reaches a factor of ~ 2 at 
z ~ 0.12. In Fig. 2, we only show groups with z < 0.12. The 
shading represents the abundance of groups for a given to- 
tal luminosity and true mass; the darker shading indicates 
a higher number density of groups. The solid line shows 
the median true mass to total luminosity relation, and the 
dashed lines enclose 68% of the groups in each mass bin. This 
relation is reasonably tight over more than two decades in 
true mass. The equivalent comparison between the dynam- 
ical mass estimate and the true mass exhibits much more 
scatter (see Eke et al. 2004b). The dynamical mass estima- 
tor is a particularly poor indicator of true mass for objects 
less massive than lQ^^h~^ Mq. We therefore choose to em- 
ploy the corrected total group luminosity as the indicator 
of the true group mass. However, caution is advisable when 
dealing with very massive groups, since, as is apparent in 
Fig. 2, the interval containing 68% of the groups with masses 
M > 10'^*/i~^MQ broadens considerably. This is due to the 
fact that such large groups, because of their rarity, are more 
likely to be found at higher redshifts, and therefore require 
a larger correction to be applied to the observed group lu- 
minosity to infer the total group luminosity. 

Further evidence supporting our choice of the total 
group luminosity as a mass indicator is given in Fig. 3. Here, 
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we show the "true" mass distributions of samples from the 
GALFORM mock selected according to two criteria: (i) cumula- 
tive bins in total luminosity (top panel) , and (ii) cumulative 
bins in dynamical mass (lower panel). The sample defini- 
tions are given by the key in each panel. These distributions 
are for groups with at legist 2 members. The different lines 
in each panel show the corresponding distributions of true 
group mass for each subsample. The points with error bars 
show the median and the range containing 60% of the groups 
for the true mass distributions. It is clear from this plot that 
the distribution of true mass for samples selected by total 
group luminosity spans a larger dynamic range in mass, and 
the distributions have tighter percentile intervals. Wc have 
performed the same exercise for groups with a minimum of 
four members and reach similar conclusions. However, we 
note that in the case of groups with 4 or more members, 
there are fewer low mass groups as expected. 

We now investigate how the clustering signal from group 
samples varies with the minimum number of group mem- 
bers, nmin, looking at the cases where rimin = 2 or 4. We 
compute an effective bias for each sample using Eq. 1. In 
Fig. 1, the effective bias is plotted at a representative mass 
for each sample (indicated by the linos which intersect the 
bias curves and run parallel and perpendicular to the mass 
axis). This mass is obtained using the GALFORM mocks by 
finding the halo mass above which the effective bias of haloes 
in the simulation cube matches that of the haloes recov- 
ered from the mock group catalogue. Wo show the effective 
bias for the brightest and faintest cumulative luminosity bins 
to illustrate the dynamic range of our clustering measure- 
ments. The dashed lines show the results of this calcular 
tion for nmin = 2 and the solid lines show the case where 
Timin ~ 4. Thc rangc of effective masses (x-axis) and clus- 
tering strengths (y-axis) probed by the different samples is 
widest when considering groups with rimin = 2. 



3 MEASURING THE CLUSTERING OF 
GROUPS 

We now outline the method followed to measure the cluster- 
ing of the 2PIGG samples. In a sample of groups extracted 
from a flux limited galaxy rcdshift survey, thc space den- 
sity of groups is a function of radial distance. An accurate 
estimate of the radial selection function of the sample is re- 
quired to make a robust measurement of thc clustering of 
groups. The procedure that we follow to obtain the selection 
function from the redshift distribution of groups is set out 
in Section 3.1. The weighting scheme used to approximate 
a minimum variance estimate is given in Section 3.2, along 
with the estimator used to infer the correlation function. 



3.1 The radial selection function 



affected by errors in the determination of total group lumi- 
nosity and by contamination, leading to systematic errors in 
the number density of groups of up to a factor of 4. 

The alternative is to estimate the selection function di- 
rectly from the observed rcdshift distribution. Thc concern 
in this case is that the redshift distribution displays features 
that are due to large scale structure (see the histogram in 
Fig. 4). Care must be taken when fitting a parametric form 
to the observed redshift distribution to avoid 'over-fitting' 
the peaks and troughs, thereby inadvertently removing some 
of the clustering signal from the estimated correlation func- 
tion. 

With this concern in mind, wc explore two procedures 
to describe the shape of thc observed rcdshift distribution: 
fitting parametric forms or smoothing the observed redshift 
distribution. We will explore the influence on the measured 
correlation function of these different ways of fitting the red- 
shift distribution in Section 4.1. 

We adopt a fit to Nobsiz), the observed 2PIGG redshift 
distribution, of the form 

Nfi,{z)dz = z"-^ X exp (^^y dz, (2) 
where a and b are parameters set by minimising the quantity 



Xn(z) 



nbins 

{_Nob!,{Zi) 

1=1 



iVfit(2i)) = 



(3) 



Alternatively, we have experimented with including a weight 
proportional to the volume of each redshift bin of width Az, 



Xv,N(z) 



nbins 

y-^ {Nobs{zij 

i=l 



(4) 



Thc effect on thc best fit parameter values of weighting 
by volume is small. From now on, best fit parameter val- 
ues a and b obtained using Eq. 3 (and the rcdshift distri- 
bution constructed using them) will be referred to as un- 
weighted fits, while those found using Eq. 4 will be referred 
to as volume- weighted fits. Thc resulting best fits for the full 
2PIGG catalogues are shown in Fig. 4, as indicated by the 
key in the upper panel. 

The parametric form adopted for the fit is particularly 
attractive since it can be integrated analytically to give the 
cumulative redshift distribution 



Nfit{z)dz 



If 

Jo 



exp 



(5) 



The smoothed redshift distribution is obtained by 
smoothing the observed redshift distribution, which is tab- 
ulated in bins of width Az = 0.0025, with a top-hat win- 
dow in redshift. The smoothed redshift distribution in Fig. 4 
was produced by smoothing the observed distribution with 
a top-hat of width Az = 0.01. 



An accurate estimate of the selection function of groups in 
the 2PIGG samples is required to permit an analysis of the 
clustering signal of the groups. 

The most straightforward way of doing this is to use the 
luminosity function of groups to obtain an estimate of the 
selection function which is unaffected by large scale struc- 
ture. However, Eke et al. (2004c) show, using the GALFORM 
mock, that estimates of the group luminosity function are 



3.2 Estimating the correlation function 

To obtain a minimum variance estimate of the correlation 
function, each group is assigned a weight given by (Efs- 
tathiou 1988): 



i 



1 + 4nn{zi)J3{s)c' 



(6) 
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Figure 4. The redshift distributions of the 2PIGG catalogue (his- 
tograms) ; the upper panel corresponds to the NGP region and the 
lower to the SOP region. The redshift distribution after smooth- 
ing is shown by the dotted line (sec text for details). The dot- 
dashed lines show volume weighted fits (Eqs. 2 and 4), and the 
dashed lines show unweighted fits (Eqs. 2 and 3). 



where we assume J3(s) = 1200^r•■*Mpc^ c is the angular 
completeness of the 2dFGRS and n{zi) is the space density 
of groups at redshift In Section 4.1 we explore the im- 
pact of the choice of Jj on the measured correlation function. 
The space density of groups, n{z), is obtained by integrat- 
ing over the product of the cosmological volume element 
and the adopted description of the redshift distribution; as 
explained above, this could be a smoothed version of the 
redshift distribution or a parametric fit. A catalogue of un- 
clustered points is produced with the same radial and angu- 
lar selection functions as the data. The number of random 
points is set to be 10 times the number of data points, with 
a lower limit of 100000. In the case of the unclustered cat- 
alogue, the weights arc scaled in order to ensure that their 
sum matches that of the weights of the observed groups: 



; WR„ 



E 



(7) 



where R and D indicate unclustered points and data groups 
respectively. 

We adopt the Landy & Szalay (1993) estimator for cal- 
culating the redshift-space correlation function. 



^(s) = {DD{s) - 2DR{s) + RR{s))/RR{s), 
where 

XY{s) = Wx,iWY,j , 



(8) 



with X and Y representing the actual groups (D) and/or 
points in the random catalogue (R); i and j are the indices 
for each individual group/random point, and w corresponds 
to the weight defined above. 

We use the same estimator to obtain the correlation 
function as a function of pair separation perpendicular (cr) 
and parallel (tt) to the line-of-sight, ^(cr, tt). Prom this cor- 
relation function, we can define the projected correlation 
function 



5(o-, 7r)do-. 



(9) 



This quantity is free from redshift-space effects arising from 
peculiar motions and can be related directly to the real space 
correlation function (e.g. Norberg et al. 2001). 



4 TESTING THE ESTIMATION OF THE 
CORRELATION FUNCTION 

The measurement of the clustering of groups requires a num- 
ber of choices to be made that can have an impact upon the 
estimated correlation function. In this section, we carry out 
a careful examination of our method for measuring the clus- 
tering of the mock groups, with the goal of determining a 
correlation function that is accurate, i.e. with minimal sys- 
tematic errors, and with random measurement errors that 
are as small as possible. The mock catalogues described in 
Section 2.2 play a key role in this exercise. The GALFORM 
mock is used to assess systematic errors by comparing the 
correlation functions measured from the semi-analytic mock 
group catalogue with the correlation function of an equiva- 
lent sample in the simulation cube (the construction of such 
a catalogue is described in Section 2.2). We stress that the 
goal of quantifying the systematic errors is to devise a clus- 
tering measurement algorithm that minimizes these errors 
rather than to correct for them. The Hubble Volume mocks 
arc used to estimate the random measurement errors, which 
automatically include the contribution of sample variance 
from large scale structure. 

An important test for systematic errors in our clustering 
measurement is to compare the correlation function recov- 
ered from a mock group catalogue with that of a comparable, 
equivalent sample drawn from the distribution of groups in 
the simulation cube. 

The rest of this section is split into two parts. In the 
first, we present a systematic study of how various assump- 
tions and approximations afi^ect the recovered correlation 
function. We focus attention on the size of the random mear 
surement errors and on minimizing systematic errors in the 
estimated correlation function. In the second part of this 
section, we build on the conclusions of the first part and 
apply an optimal method to measure the clustering of mock 
2PIGGs to determine how well we cam recover trends in clus- 
tering amplitude for diflferent subsamples of groups. 



4.1 An optimal measurement of clustering 

The main results of this subsection arc presented in Fig. 5. 
Each plot in this figure is divided into two panels. The top 
panel in all cases compares the correlation function recov- 
ered from a GALFORM mock group catalogue with the cor- 
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Figure 5. The accuracy with which the correlation function can be measured from the GALFORM mock catalogue for different assumptions 
and approximations. In each plot, the upper panel shows the systematic error on the measured correlation function, expressed as the 
ratio of the correlation function measured from a mock 2PIGG sample to that measured from an idealized, comparable sample. The 
errors on this ratio come from the rms scatter over the ensemble of Hubble Volume mocks; these are used to plot the logarithm of the 
relative errors in the lower panel of each plot. The different plots show how well the correlation function can be measured when the 
following are varied: (a) The sample definition, as given in the text: the solid lines shows results for a sample of groups brighter than a 
minimum luminosity, the dotted line shows a sample of groups more massive than a minimum dynamical mass limit and the dashed line 
shows the case of groups within a differential luminosity bin. (b) The minimum group membership. The thinner lines show results for 
a brighter sample of groups (L > lO^'^^h^^ Lq. (c) The description of the redshift distribution used to model the selection function of 
groups, (d) The maximum redshift used to define the catalogue; again the thinner lines show results for a brighter sample. 
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relation function of an equivalent sample, drawn from the 
simulation cube, of groups generated by GALFORM. The er- 
rors on this ratio are the rms scatter over the ensemble of 
22 Hubble Volume group mocks. Note that this estimate 
of errors includes cosmic variance, and therefore overesti- 
mates the error bars in a comparison between results from 
the mock and the simulation cube, since both come from the 
same simulated volume. In the absence of systematic errors 
in the clustering measurement, this ratio would be consis- 
tent with unity, indicated by the horizontal line. The lower 
panel in each plot shows the logarithm of the relative error 
on the measured correlation function, as deduced from the 
rms scatter over the Hxibblc Volume mocks. The horizontal 
line indicates an error of 100%. To improve the clarity of the 
presentation, we have used a power law fit to the correlation 
function for separations larger than 8h~^Mpc in the defi- 
nition of the relative error, which avoids dividing by small 
or negative numbers. The correlation function of the equiva- 
lent sample of groups falls below a power law on large scales, 
typically s ^ 15^~^Mpc, so the plotted error is a lower limit 
to the true relative error on these scales. We now go through 
the set of choices or assumptions that needs to be made in 
our clustering measurement method, directing the reader to 
Fig. 5 where appropriate. 

(i) Choice of 2dFGRS region. The differences between 
the correlation functions measured in the NGP and SGP 
regions provide a crude estimate of the errors; these differ- 
ences are comparable to the errors inferred from the Hubble 
Volume mocks. We find that the correlation functions mea- 
sured in mock NGP and SGP 2dFGRS regions are consistent 
within the errors. Combining the pair counts of groups in the 
two regions reduces the Poisson noise and sample variance 
in the clustering measurement. Henceforth, our results will 
be for combined NGP and SGP samples. 

(ii) Choice of J3 in the radial weight. The weight as- 
signed to each group to compensate for the radial selection 
function requires a value for the integral of the two-point 
correlation function, J3(s) (Eq. 6). We have explored using 
constant values in the range 400 < Js/h-^Mpc^ < 2400, and 
find that the changes in the estimated correlation function 
are small and within the statistical errors. We adopt .J3 = 
1200h~^Mpc^, which corresponds to the typical asymptotic 
value of Jsis) for separations s > 10h~^Mpc in the Hubble 
Volume mocks. 

(iii) Sample definition. Fig. 5(a) compares the corre- 
lation function measured for samples defined in different 
ways: by setting (i) a lower limit on total group luminosity, 
L > 10'^'''^h~^Lo (solid line), (ii) a lower limit on dynami- 
cal mass, Afdyn > lO^^ ^/r^M© (dotted line), which, for the 
median mass-to-liglit ratio of the 2PIGGs (Eke ct al. 2004b), 
corresponds to the luminosity cut in (i), and (iii) a differ- 
ential luminosity sample defined by 10^°-^^ < L/h~^LQ < 
■|^gio.66 ((jashed line). The correlation function measured for 
the sample of groups brighter than the luminosity threshold 
in (i) has the smallest systematic and random errors. This 
adds further weight to the preference arrived at for cumula- 
tive bins in luminosity in Section 2.4. The three remaining 
panels. Fig. 5(b), (c) and (d) show the errors on ^(s) for 
samples of mock groups with L > lO^^'^h^^I/Q (shown by 
solid lines), and, in some cases for a brighter sample with 
L > 10^°-^h~^MQ (indicated by dashed lines). 

(iv) Minimum number of group members. We compare 



Figure 6. The relative errors on our measurement of ^(s), ob- 
tained by dividing the rms scatter in ^(s) from the Hubble Volume 
mocks by the mean value of i;(s) for s < Sh^^Mpc, and a fit to 
the mean ^(s) (over 8 < s/{h-^Mpc) < 15) for s > Sh-^Mpc. 
The horizontal black line shows the 100% error limit. Different 
line types correspond to different subsamples as indicated by the 
key. 



the accuracy of the correlation function for groups with a 
minimum number of members of Wmin ~ 2 (solid lines) and 
?^min ~ 4 (dashed lines) in Fig. 5(b). The systematic errors 
on the correlation functions recovered from the mocks are 
roughly comparable in the two cases. The random errors, 
however, are smaller for the case of nmin = 2. This is ex- 
pected since this sample contains more groups. We recall 
that samples of groups with Umin = 2 span a wider range in 
mass (see Fig 3 in Section 2.4). The dashed lines in Fig. 5(b) 
show the results for a brighter sample of groups, as explained 
at the end of (iii) above. This sample contains fewer groups 
and so the random errors are somewhat larger than before, 
particularly at small separations. The systematic errors in 
the correlation function recovered from the mock 2PIGG 
also become significant below s fs 2/i~^Mpc for the brighter 
sample. 

(v) Selection function model. The impact on the corre- 
lation function of using different approximations to the form 
of the redshift distribution of groups to model their selection 
function is shown in Fig. 5(c). The solid line shows how well 
the correlation function can be measured using a paramet- 
ric fit to the redshift distribution of groups in the mock, as 
described in Section 3.1; the dotted line shows the results 
when a volume weighting is applied to determine the best fit 
parameters. On small scales, s < 3/i^^Mpc, the systematic 
and random errors are very similar in these two cases. On 
larger scales however, the unweighted fit results in a more 
faithful and less noisy measurement of the correlation func- 
tion. The evidence in this panel shows that the approach of 
using the smoothed redshift distribution of groups to model 
their selection function (dashed line) is clearly flawed, lead- 
ing to a significantly discrepant estimate of the correlation 
function for separations s > l/i~^Mpc. 
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Figure 7. The correlation functions measured in the GALFORM 2PIGG mock for samples defined by lower group luminosity limits 
(symbols with error bars). In the left hand column, results are shown for nmin = 2 and in the right for rimin = 4. These measurements are 

compared with the correlation functions of equivalent samples of groups taken from the simulation cube (lines). Two different methods 
of constructing these equivalent samples have been used. In the top row, the equivalent samples have the same mass function as the 
groups in the sample from the mock 2P1GG catalogue. In the bottom row, the equivalent samples are complete above some halo mass, 
chosen so that the groups have the same effective bias as those recovered from the mock 2P1GG catalogue. 



(vi) Sample redshift limit. An increase in the redshift 
limit of the sample leads to a larger volume and the associ- 
ated increase in the size of the sample. However, these addi- 
tional groups come at the expense of requiring a larger cor- 
rection to the observed group luminosity to deduce the total 
group luminosity. The consequences of varying the majd- 
mum redshift limit of the sample are presented in Fig. 5(d), 
where the cases ^max = 0.12 (light lines) and Zmax = 0.15 
(dark lines) are considered. There is a negligible difference 
in the random measurement errors for these redshift limits. 
The correlation function is more accurately recovered for 

2^max — 0.12. 

The conclusions of this subsection are that an optimal 



measurement of the clustering of the 2PIGG catalogue will 
be obtained under the following conditions: (i) The pair 
counts in the NGP and SGP regions of the 2dFGRS are 
combined, (ii) The value of J3 is taken to be 1200h~^Mpc^, 
though the results are fairly insensitive to the exact value, 
(iii) Cumulative bins in luminosity are used to define sub- 
samples of groups from the 2PIGG catalogue, (iv) ramin = 2 
is used, which leads to the most reliable measurements, (v) 
The most accurate model of the selection function is ob- 
tained by a simple parametric fit to the observed group red- 
shift distribution, (vi) A conservative redshift of Zmax = 0.12 
is used. 
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Figure 8. The correlation function, ^(c, vr), for selected 2PIGG samples, as indicated by the label on each panel. The grey scale indicates 
the amplitude of the correlation function, with darker shading indicating stronger clustering: ^(tr, 7r) = 0.2, 0.4, 0.6, 0.8, 1, 1.2, 1.4, 2, 3 and 
5 are shown. The ^ = 1 contour is also marked by a thick black line. The thin bla<;k lines show the expected shape of the ^(tr, tt) contours 
in the absence of redshift space distortions of the clustering pattern. 



4.2 The clustering of subsamples of groups 

We now investigate how well we can distinguish between the 
clustering signals displayed by different samples of groups 
defined by mass. We also test our clustering measurement 
algorithm to see how faithfully the clustering signal can re- 
produce an idealized measurement. This is an important 
consideration, as this determines the strength of the con- 
straints wc can place on theoretical models of the clustering 
of galactic systems. 

The errors on the measured correlation function are ob- 
tained from the rms scatter over the ensemble of Hubble 
Volume mock 2PIGG catalogues. We show these relative er- 
rors in Fig. 6. The horizontal line marks an error of 100%. 
This plot indicates that we should be able to get a robust 



measurement of the correlation function out to pair separa- 
tions of s ~ 30/i~^Mpc. The small scale limit depends more 
upon the sample under consideration, due to the changing 
space density of groups. For the full group sample, the corre- 
lation function can be measured down to separations below 
s ~ l/i~^Mpc; for the most luminous sample, which has a 
much lower space density, the smallest pair separation that 
can be probed is s w 3/i~^Mpc. 

To test the accuracy of our clustering measurements, 
we have extracted samples of groups from the GALFORM mock 
catalogue defined by a scries of lower limits in total group 
luminosity as discussed earlier. The clustering measured for 
these samples is compared with that of samples drawn from 
the simulation cube using two different criteria to construct 
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equivalent samples as discussed earlier. The samples drawn 
from the simulation cube are idealized in the sense that they 
do not have a radial or angular selection function imposed 
upon them as is the case for the mock 2PIGG catalogue. 
Moreover, the groups occupy dark matter haloes identified 
by a friends-of-friends group finder. The mass of a group in 
the simulation cube is known very accurately: it is simply 
the sum of the mass of the dark matter particles connected 
by the group finder. 

The correlation functions measured for mock group sub- 
samples defined by a lower limit on total group luminosity 
are compared with the results for the equivalent samples 
drawn from the simulation cube in Fig. 7. The top row of 
this figure shows the comparison when the equivalent sam- 
ples axe set up to reproduce the mass function of groups 
recovered from the mock group catalogue for each luminos- 
ity subsample. The bottom row shows the comparison for 
equivalent samples constructed to match the effective bias 
estimated from the mock subsamplcs. The left hand column 
shows the results for groups with a minimum membership 
nmin = 2, and the right hand column for nmin = 4. The 
clustering measurements from the mock subsamples are in 
impressively good agreement with the results obtained for 
the equivalent samples in the simulation cube, particularly 
for the case of nmin ~ 2. The equivalent samples constructed 
to reproduce the effective bias of mock group subsamples are 
in somewhat better agreement with the measurements from 
the mocks for the brighter group samples. 

The agreement in clustering amplitude between samples 
of groups taken from a mock and those extracted from an 
idealized simulation is significant; it means that wo fully 
understand the impact of making a mock group catalogue 
on the measurement of the clustering of groups and can use 
the clustering of 2PIGG samples to constrain theoretical 
models of structure formation. 



5 2PIGG RESULTS 

In this section we measure the clustering amplitude of the 
real 2PIGGs for samples defined by total group luminosity. 
Some bEisic properties of the samples are given in Table 1. 
We apply the optimal clustering measurement algorithm set 
out at the end of Section 4.1. We combine the pair counts in 
these regions to estimate the correlation function. We have 
checked that the measurements for the individual regions 
agree within the errors. 

An important test of the integrity of a group catalogue 
is the form of the correlation function measured in bins of 
pair separation parallel (tt) and perpendicular (a) to the 
line of sight, ^(cr, tt). Padilla & Lambas (2003a,b) explored 
different techniques used in the construction of group and 
cluster catalogues. They found that the shape of ^(cr, tt) is a 
powerful way to reveal spurious agglomerations of galaxies. 
A tell-tale sign of problems with the composition of galaxy 
groups is a significant enhancement of the correlation func- 
tion, ^((j, tt), in the tt direction, as seen in the early analyses 
of Abell catalogue clusters (e.g. Bahcall & Soneira 1983; 
Sutherland 1988). Such a distortion of the clustering pat- 
tern is seen for 2dFGRS galaxies, due to the large peculiar 
motions inside massive virialised structures (Peacock et al. 
2001; Hawkins et al. 2003). However, for the case of groups 
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Figure 9. The projected correlation function of selected 2PIGG 
samples. The samples are indicated by the key; error bars axe 
plotted on the measurements for selected samples for clarity. The 
solid line shows a fit to the projected correlation function of 2dF- 
GRS galaxies talsen from Hawkins et al. (2003). 

and clusters, such virialised structures are not predicted in 
viable models of structure formation (Kaiser 1987; Padilla et 
al. 2001; Padilla & Baugh 2002). The correlation function, 
5(<T, tt), for samples of 2PIGGs is shown in Fig. 8. There is 
no enhancement of the clustering signal in the tt direction. 
The contours of constant clustering amplitude are, however, 
affected by peculiar motions, showing the flattening due to 
the infall motions expected in the gravitational instability 
paradigm (Padilla et al. 2001; Peacock et al. 2001). There 
is also a clear increase in clustering amplitude with sample 
luminosity. 

The projected correlation function, E{a)/a, is the in- 
tegral of ^((j, tt) over the tt direction (e.g. Norberg et al. 
2001). This statistic is unaffected by peculiar motions. In 
Fig. 9, we show the projected correlation functions for se- 
lected samples of 2PIGGs. The black line shows the best fit 
to the projected correlation function of galaxies in the 2dF- 
GRS for comparison (Hawkins et al. 2003). We find that the 
lowest luminosity sample plotted is less strongly clustered 
than the galaxy distribution, a point to which we return 
later on in this section. The brightest sample of groups dis- 
plays a clustering amplitude that is an order of magnitude 
stronger than that measured for 2dFGRS galaxies. 

The projected correlation function, due to the way it 
is calculated, can be unreliable if the correlation function 
is noisy on large scales. A more robust quantity in such 
cases is the redshift space correlation function, ^(s). This 
quantity is the average of ^(ct, tt) within annuli centred on 

s = (^a^ + TV^Y ^. The redshift space correlation function 
measured for 2PIGG samples with membership nmin > 2 is 
plotted in the left-hand panel of Fig. 10. For reference, we 
also plot the redshift space correlation function of 2dFGRS 
galaxies measured by Hawkins et al. (2003). The correlation 
function of the group samples has a similar shape to that ob- 
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Figure 10. Left panel: The redshift space correlation function, £,{s), measured for samples of 2PIGGs. The samples are shown by the 
key. Error bars are plotted for selected samples for clarity. The solid line shows ^(s) for 2dFGRS galaxies, measured by Hawkins et al. 
(2003) . Right panel: The ratio of the group correlation function to the galaxy correlation function, plotted on a logarithmic scale. 



tained for galEixies, on scales where a comparison is reliable. 
There is, however, a dramatic change in clustering amplitude 
with group luminosity, mirroring that seen for the projected 
correlation function in Fig. 9. These points are emphasized 
in the right hand panel of Fig. 10, in which we plot the group 
correlation function divided by the galaxy correlation func- 
tion. There is a relative bias between the spatial distribution 
of groups and galaxies. For the full 2PIGG sample, and also 
for the next faintest luminosity sample, this relative bias is 
actually an anti-bias; those groups are more weakly clus- 
tered than the galaxies. The brightest groups, on the other 
hand, are almost ten times more strongly clustered than the 
galaxy distribution. 

A useful summary of the clustering measurements can 
be made by plotting the correlation length, so, as a function 
of the mean group separation, rfc, which is related to the 
space density of groups, n, by dc = The space den- 

sity of groups is estimated using the cumulative bj— band 
luminosity function of 2PIGGs out to 2; = 0.12 (Eke et al. 
2004b). The space density of each subsample is simply the 
luminosity function evaluated at the appropriate lower limit 
in luminosity. We estimate the correlation length (defined 
by C('So) = 1) by fitting a power law to the measured red- 
shift space correlation function for pair separations in the 
range 0.4 < logiQ(s/h~^Mpc) < 1.3. Our results are fairly 
insensitive to perturbations to this range. 

The clustering strength of 2PIGG samples versus mean 
group separation is plotted in Fig. 11. There is an increase 
in clustering strength with the mean separation of groups 
in each sample; the clustering strength increases slightly 
less rapidly than the change in group separation. There 
is very good agreement between the results for rimin = 2 
and nmin = 4 for group separations for which a compari- 
son is possible. Fig. 11 also shows the so-dc relation in the 



GALFORM mock 2PIGG catalogue with rimin = 2, which is in 
very close agreement with the results from the real 2PIGG 
samples. The solid and dotted linos show the measurements 
for equivalent samples of groups drawn from the simulation 
cube. The solid line gives the so-dc relation in redshift space 
and the dotted line shows the real space results. The clus- 
tering amplitude is typically 10%-20% weaker in real space 
than in redshift space. 

The 2PIGG results are compared with a selection of 
measurements taken from the literature in Fig. 12. This com- 
parison between datasets should be treated with caution due 
to a number of differences in the way the various samples 
have been defined and analysed: namely, in decreasing order 
of seriousness: (i) the technique used to identify groups and 
clusters, (ii) the derivation of a value for the mean inter- 
group separation and (iii) the calculation of the correlation 
length. 

Zandivarez et al. (2003) measured the clustering of 
groups in a catalogue constructed from the 100k release of 
2dFGRS data by Merchan & Zandivarez (2002). The two 
most abundant samples of groups analysed by Zandivarez et 
al. have a significantly higher clustering amplitude than we 
recover for 2PIGG groups of comparable abundance. Zandi- 
varez et al. estimated dc from a dynamical estimate of the 
group mass which they translated into a spatial abundance 
using the measured mass function of groups (Martinez et 
al. 2002). Moreover, Zandivarez et al. only consider groups 
with riniin = 4; such groups are probably only associated 
with low mass systems through errors in the dynamical mass 
estimates. The abundance of groups in the Updated Zwicky 
Catalogue was estimated in a similar way by Merchan et 
al. (2000). Bahcall et al. (2003) have applied a photometric 
technique (Annis et al. 2004) to a subset of the Sloan survey 
data to obtain a sample of clusters that overlaps with the 
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Figure 11. The correlation length in redshift space, sq, plotted 
as a function of the mean group separation, dc- The 2PIGG re- 
sults are shown by filled squares for rimin = 2. Also shown for 
comparison are measurements from the GALFQRM mock plotted 
with triangles. The solid line shows the so — dc relation obtained 
from the simulation cube for equivalent samples of groups (see 
text). The dotted line shows how these results change when the 
correlation length is measured in real space. 



1.6 



1.4 

o 

? 1.2 



1 



O 



0.6 



■ti 2dFGRS (Norberg et al. 8002) 

■ 2PIGG 

b/b.=0.85 + 0.15 L/L. 

s„=1.67 d;»= (L>10i°-2h-2Ls) 

s„=2.34 d;^= (all points) 





1 1.5 
Iog,„(dyh-iMpc) 



2 



Figure 13. A comparison of the SQ—dc relation for 2PIGG groups 
(squares) and for 2dFGRS galaxies (stars, taken from Norberg et 
al. 2002a). The solid line is a fit to the 2PIGG clustering data, 
excluding the lowest luminosity sample; the dotted line is a fit to 
the 2PIGG results including the faintest sample. The dashed line 
shows the fit to the clustering strength-luminosity relation fitted 
to 2dFGRS data by Norberg et al. (2001). 
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Figure 12. A comparison of the SQ — dc relation for different sam- 
ples. The squares show the results from the 2PIGGs for Wmin = 2. 
The other symbols show a selection of data taken from the liter- 
ature, as indicated by the key. The lines show fits to subsets of 
the data, with sources indicated by the key. 



most luminous 2PIGG groups and with APM Survey clus- 
ters (Croft et al. 1997; Dalton et al. 1997). The Sloan mea- 
surements are in reasonable agreement with the results from 
APM clusters and are marginally lower than the 2PIGG val- 
ues at comparable abundances. However, both the SDSS and 
APM clusters are identified in projection. The trend of clus- 
tering strength versus abundance found for 2PIGGs agrees 
quite well with the measurements from X-ray selected clus- 
ter samples (Abadi, Muriel & Lambas 1998; Collins et al. 
2000; Lee & Park 2002). 

The dotted line in Fig. 12 shows the so-dc relation advo- 
cated by Bahcall & West (1992), whereas the dot-dashed line 
is the relation proposed by Bahcall et al. (2003); the dashed 
line is the fit of Zandivarez et al. (2003) to their data. The 
analogous fit to the 2PIGG results is plotted in Fig. 13 for 
clarity. If the faintest groups are ignored, the 2PIGG results 
are described by the relation sq = 1.67d2'®^. This is some- 
what steeper than the fit of Bahcall et al. (2003). 

We compare the clustering of 2PIGG groups with 2dF- 
GRS galaxies in Fig. 13. The clustering amplitude of 2dF- 
GRS galaxies is taken from Norberg et al. (2002a), who anal- 
ysed volume limited samples defined by galaxy luminosity. 
The curve plotted over the galaxy clustering measurements 
shows the predictions of the simple relation between relative 
clustering strength and luminosity deduced by Norberg et 
al. (2001). The full sample of 2PIGGs is more weakly clus- 
tered than L« and brighter galaxies. On the other hand, the 
trend of clustering strength with pair separation is stronger 
for groups than it is for galaxies. 
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6 CONCLUSIONS 

We have measured the clustering of groups in the 2PIGG 
sample constructed from the 2dFGRS by Eke et al. (2004a). 
The group catalogue, made up of galactic systems with a 
minimum of two members, contains ~29,000 groups, out of 
which ~16,000 arc at 2: < 0.12. The sample is sufficiently 
large and homogenous that a robust measurement of cluster- 
ing is possible for subsamples of groups defined by their cor- 
rected total luminosity. Our analysis has relied extensively 
on the use of various mock group catalogues constructed 
using high resolution N-body simulations populated with 
galaxies using the GALFORM semi-analytic model (Cole et al. 
2000; Benson et al. 2000, 2003). 

In summary, the main conclusions we have reached are 
the following: 

(i) The main goals of this paper were to make a robust 
measurement of the clustering of galaxy groups with the aim 
of exploring the dependence of the clustering amplitude on a 
property related to group mass (for example the total group 
luminosity) and to compare the results with theoretical pre- 
dictions. We have tested our algorithm for estimating the 
correlation function by comparing results obtained from a 
mock 2PIGG catalogue with those derived from equivalent 
samples drawn from the N-body simulation cube. The suc- 
cess of this comparison is significant for two reasons. Firstly, 
in order to make a robust measurement of the clustering of 
groups from a flux limited galaxy survey, it is necessary to 
maJie a number of approximations and to choose certain 
parameter values (see Section 4.1). We have tuned the al- 
gorithm for measuring the clustering of 2PIGG groups by 
requiring a close match between the correlation functions 
estimated from the mocks and the original results from the 
simulation cube. Secondly, the ability to extract equivalent 
samples of groups from a simulation volume with the same 
clustering as samples taken from mock 2PIGG catalogues 
makes it possible to interpret the 2PIGG clustering results 
in the context of structure formation models. 

(ii) We find that clustering amplitude increases substan- 
tially with total group luminosity. The 2PIGG catalogue al- 
lows clustering measurements to be made for samples span- 
ning a factor of 25 in median total luminosity. The correla- 
tion function increases in amplitude by an order of magni- 
tude over this luminosity interval. Another way of expressing 
this is that the redshift space correlation length changes by 
a factor ~ 3.5 from the faintest to the brightest sample. The 
most luminous groups we consider have a mean pair separa- 
tion that is « 5.5 times greater than that of the full group 
sample. There is little change in the shape of the correlation 
function with group luminosity. 

(iii) The shape of the redshift space correlation function 
of groups is very similar to that measured for 2dFGRS galax- 
ies on the scales for which a comparison is possible. The full 
2PIGG sample has a weaker clustering amplitude than is 
measured for 2dFGRS galaxies; the correlation length of the 
2PIGG sample with rimin = 2 is so = 5.5±0.1/t^^Mpc, whilst 
that of 2dFGRS galaxies is 6.82 ± 0.28/i"^Mpc (Hawkins et 
al. 2003). However, the clustering amplitude of brighter sam- 
ples of groups is much greater than that of 2dFGRS galaxies. 

(iv) The correlation functions measured for 2PIGG sam- 
ples are in very good agreement with the predictions of a 
semi-analytic model of galaxy formation, the GALFORM model 



(Cole et al. 2000; Benson et al. 2003), in a cold dark mat- 
ter universe with a cosmological constant (ACDM). Previous 
work has compared the predictions of the ACDM model with 
measurements of the correlation length versus abundance re- 
lation for rich clusters (Governato et al. 1999; Colberg et al. 
2000) . To extend the theoretical calculations down to group- 
sized systems, it is essential to extend the predictions of the 
spatial distribution of dark matter haloes in the ACDM cos- 
mology with a model for galaxy formation. The galaxy for- 
mation model predicts how the dark matter haloes should 
be populated with galaxies. For this reason, the clustering of 
groups provides a more stringent test of theories of galaxies 
formation than the clustering of the richest clusters. 

(v) The trend of correlation length (measured in redshift 
space) against mean inter-group separation can be quan- 
tified as: So = 1.67d° ®^. The 2PIGG catalogue has made 
possible a robust measurement of the clustering of galactic 
systems, ranging from poor groups to rich clusters, from one 
sample for the first time. Our results are in general agree- 
ment with those obtained previously for rich groups and 
clusters. 

Galaxies and groups of galaxies trace the underlying 
distribution of dark matter haloes in different ways. Galax- 
ies trace the halo distribution in a complex way. The halo 
occupation distribution (HOD), which depends strongly on 
halo meiss, has been used extensively to constrain theoreti- 
cal models (e.g. Benson et al. 2000; Peacock & Smith 2000; 
Seljak 2000; Berlind et al. 2003). In principle, the group 
sample should trace the halo distribution in a simple fash- 
ion, with every halo above some mass spawning one galaxy 
group. However, in practice, the identification of groups in 
real surveys is complicated, making this ideal difficult to 
attain. Because of this it is essential to devise a scheme to 
quantify the fidelity of a group catalogue and to uncover any 
biases in comparisons with theoretical models by applying 
the group finder to mock catalogues (Eke et al. 2004a). 

The difference between the clustering amplitude mea^ 
sured for the full 2PIGG and the full 2dFGRS galaxy sam- 
ples can be readily understood in terms of the HOD. Ideally, 
the groups have a one-to-one correspondence with the un- 
derlying dark matter haloes within some mass interval. The 
galaxies obviously sample haloes in the same mass range, but 
with a weight that increases with halo mass, since the most 
massive haloes host more than one galaxy. Thus, the more 
massive haloes, which are the more strongly clustered, make 
a larger contribution to the correlation function of 2dFGRS 
galaxies than they do in the case of the full 2PIGG sample. 
The clustering of galaxy groups as a function of group lumi- 
nosity therefore provides an important new test of theories 
of galaxy formation. 
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